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Abstract 

Background: The exponential growth of gigantic biological data from various sources, such as protein-protein inter- 
action (PPI), genome sequences scaffolding, Mass spectrometry (MS) molecular networking and metabolic flux, demands 
an efficient way for better visualization and interpretation beyond the conventional, two-dimensional visualization tools. 

Results: We developed a 3D Cytoscape Client/Server (3DScapeCS) plugin, which adopted Cytoscape in interpreting 
different types of data, and UbiGraph for three-dimensional visualization. The extra dimension is useful in accommodating, 
visualizing, and distinguishing large-scale networks with multiple crossed connections in five case studies. 

Conclusions: Evaluation on several experimental data using 3DScapeCS and its special features, including multilevel 
graph layout, time-course data animation, and parallel visualization has proven its usefulness in visualizing complex data 
and help to make insightful conclusions. 



Background 

Cytoscape [1] is a free open source platform providing 
biological network analysis and two-dimensional (2D) 
visualization for biologists. With more than 172 registered 
plugins contributed by the community, it is very versatile 
in network applications, such as network importing, 
network integrating, inference customization, literature 
mining, topological clustering, functional enrichment, net- 
work comparison, and programmatic access [2]. However, 
it lacks the capability of displaying the large-scale net- 
works in three-dimensions (3D) or beyond. With the ex- 
ponential growth of gigantic biological data deposited in 
the public domain, the extra space visualization would 
offer more flexibility for layered representation [3] and 
heterogeneous data visualization [4]. Although a few appli- 
cations are available for 3D network visualization, such as 
BioLayout [5], Wilmascope 3D [6], Arena3D [7], FORG3D 
[8], the lack of community support has hindered their 
widespread use. Tulip [9] features 3D display and a 
variety of plugins, but it is far less versatile as Cytoscape in 
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biological studies. Moreover, some of them suffer perform- 
ance drawbacks when displaying large networks (Table 1). 
Although there are Cytoscape s 3D plugins using Processing 
library [10] and JOGL [11], they are still under develop- 
ment. There is also an approach using RCytoscape [12] and 
RGL to create 3D visualization for Cytoscape, but it is not 
well developed in terms of its layout performance and 
ease of use. Although the release of Cytoscape 3.0 [13] 
claims that through its OSGi engine, it enables user to 
switch Tenderers in Cytoscape. However, graph manipula- 
tion and layout algorithms are both required to be changed 
while switching 2D to 3D, which means excessive override 
of Cytoscape core modules. Taking into consideration of 
the difficulties faced in implementation, adopting a light- 
weighted external renderer is a better choice. Moreover, 
C++ implementation is often superior in performance 
than Java when it comes to layout large graph with more 
than ten thousands nodes. Therefore we choose UbiGraph 
[14], which is a 3D visualization tool supporting up 
to eight POSIX threads and multilevel force-directed 
graph layout of high performance and aesthetic pleasure. 
We developed 3DScapeCS, a Cytoscape plugin providing 
three-dimensional, dynamic, parallel network visualization 
for Cytoscape in client-server (C/S) architecture (in 
Additional files 1 and 2). Despite criticism that developers 
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Table 1 A comparison between available 3D network visualization tools 



Program 
name 


File format 


Operating 
system 


Programming 
language 


Source code 
availability 


Time course 
data support 


Layout 
algorithm 


Performance* 


Arena3D [7] 


Arena3D 


Platform- 
Independent 


Java 


yes 


yes 


Layered 


2,000 nodes, 
10,000 edges 


BioLayout 

Express 3D 
pj 

F0RG3D [8] 


nr \/\ a\ "t\/t r~ . .„u^ a i 

blr, XML, 1 XI, GraphML, 
Matrix, Expression 


Platform- 
Independent 


Java 


yes 


yes 


Fruchterman- 
Rheingold 


500 nodes, 2,500 
edges 


F0RG3D 


Windows, Linux 
and Mac OS X 


C++ 


yes 


no 


Force-directed 


4,000 nodes, 
40,000 edges 


Vaa3D [4] 


TIFF, SWC, CSV 


Windows, Linux 
and Mac OS X 


C++ 


yes 


yes 


Location 
based 


2,500 textured 
nodes and edges 


Wilmascope 
3D [6] 


Wilmascope, GML 


Platform- 
Independent 


Java 


yes 


yes 


Force- 
directed, etc. 


2,000 nodes, 5,000 
edges 


3DScapeCS 


any format supported by 
Cytoscape 


Platform- 
Independent 


Java 


yes 


yes 


Multilevel 
Force-directed 


10,000 nodes, 
250,000 edges 



^Approximate network size measured with minimum frames per second (fps) which enables smooth manipulations (drag, rotate, etc.) on an eight core Intel Xeon 
2.13 GHz workstation with an on-board Matrox G200 graphics card. 



favour in building 3D visualization tools rather than analyz- 
ing using such tools has arisen within the bioinformatics 
community [15]. 3DScapeCS has undergone extensive test- 
ing among molecular chemistry, genomics and proteomics 
studies, which showcased as the following (in Additional 
file 3). 

Implementation 

Visualization of three-dimensional networks 

3DScapeCS converts an existed Cytoscape network to a 
3D view as shown in Figure la. The conversion can retain 



almost all settings such as the colours, sizes, and shapes of 
all nodes as similar as those from the existing 2D view. 
The users can easily manipulate the 3D view by using 
mouse. Operations including rotation and zooming of the 
display have been implemented in UbiGraph. We have 
implemented network communication between UbiGraph 
and Cytoscape as well. On double -clicking a node in 
UbiGraph, its attributes will be shown in the Node/Edge 
Attribute Browser panel in Cytoscape (Figure lb). A 
searching box is provided for user to search the nodes/ 
edges by corresponding identifiers. Whenever the search 
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Figure 1 The 3DscapeCS interface. Running 3DscapeCS in Cytoscape. (a) 3DScapeCS control panel; (b) Selected nodes/edges are listed in 
Attribute Browser; (c) Test data converted into 3D view using UbiGraph; (d) Corresponding 2D view in Cytoscape. 
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succeeds, the node matched will be highlighted with a 
label in the 3D view (Additional file 4). 

Three-dimensional layout 

The layout algorithms are required to generate meaning- 
ful and aesthetic drawings of biological networks. The 
layout algorithm built in UbiGraph is force-directed 
based. It acts by balancing the repulsive force between 
nodes and attractive force from the edge. After several 
rounds of movement and annealing, it generates aesthet- 
ically pleasing graph layouts with all edges of nearly 
equal length. The force-directed layout algorithm imple- 
mented in Cytoscape has a complexity of 0(N 3 ) and 
low-quality local drawing [16], it is only suitable for net- 
works containing up to several hundred nodes. Other 
global force-directed algorithms such as Fruchterman- 
Reingold [17] experience the same bottleneck. Therefore 
local force-directed algorithms such as Fast Multipole 
Multilevel Method (FM3) [18] and Multilevel layout al- 
gorithm [14] have been developed. Hereby we adopted 
the multilevel layout algorithm in UbiGraph for larger 
networks. In this approach, the network is partitioned 
into sub-networks. Only forces within the scope of sub- 
networks are calculated, so that the final graph layout 
can be obtained in 0(N/ogN) runtime. Therefore it will 
greatly reduce the duration required to complete three- 
dimensional layout, which is significantly helpful in 
visualization of large-scale networks (Figure lc). 

Time-course data animation 

Cytoscape supports the mapping of gene expression data 
to node color, label, and border thickness. With Vista- 
Clara [19] or SpotXplore [20] plugin installed, the user 
can distinguish expression intensities from a heat map 
themed node colour scheme. VistaClara even creates an 
animated view of the network for displaying the changes 
over time in a series of experiments. In our implementa- 
tion, users are able to explore expression data using Vis- 
taClaras heat map scheme or user-defined color scheme 
(Figure 2). Moreover, time-course data can be attributed 
to node sizes or edge thickness, making it possible for 
visualizing various data types, such as data from meta- 
bolic flux analysis, with thicker edge representing larger 
flux, and vice versa (Figure 3). We also implemented mo- 
tion network in 3DScapeCS, made it possible to study the 
changes over time or different conditions, such as mass 
spectral molecular networks (Figure 4). 

Parallel visualization 

3DScapeCS provides an approach for visualizing one 
network from multiple aspects by adopting parallel 
visualization. The Cytoscape network view can be simul- 
taneously rendered in UbiGraph on multiple computer cli- 
ents through network communication. Different clients 



can be set to visualize different aspects of the graph, either 
a reversal of perspective, or another time-point from a 
time-course animation. Unlike CytoscapeRPC [21], which 
use XML-RPC to modify networks in Cytoscape from cli- 
ents, UbiGraph clients serve as XML-RPC servers in 
3DScapeCS architecture. Therefore any changes made to 
the network view in Cytoscape, either change the size/ 
colour of a node, or add/delete a node, can be reflected on 
all clients by synchronizing the network data between Ubi- 
Graph clients and Cytoscape. 

Results 

Genome sequences scaffolding visualization 

Next-generation sequencing (NGS) and third-generation 
sequencing technologies such as PacBio SMRT greatly 
facilitated whole genome sequencing. However, com- 
plex genomic structures that cause sequence bias even 
with high genomic coverage and repeated sequences 
that may cause gaps in assembly still hamper gap clos- 
ure. Determining the relationships between contigs or 
scaffolds is therefore very important. Showing straight- 
forward graph-based relationships of contigs in Cytos- 
cape rather than tables is more intuitive, and will help 
planning for further PCR validation. ContigScape [22] is 
a Cytoscape plugin for helping visualizing genome scaf- 
folds. However, complex contig relationships in highly 
repetitive regions are hard to resolve since they have 
many intersections. By converting such network into 
three-dimensional graph, the relationships between 
contigs become easier to be identified. In the project of 
Ralstonia sp. genome scaffolding [22], three repetitive 
contigs have several connections which are hard to be 
distinguished, while nodes are well dispersed and their 
relationships with other contigs are much clearer in cor- 
responding 3D view (Figure 5). 

Dynamic visualization of metabolic flux data 

Flux balance analysis (FBA) is a powerful tool in simulat- 
ing metabolism in genome scale reconstructions of meta- 
bolic networks [23]. Dynamic exploration of flux data will 
give user an intuitive guidance on metabolic changes. 
In 3DscapeCS, metabolic flux data visualization can ei- 
ther be reflected in node size (reaction network), or 
edge thickness (substrate-product network). Here we 
use E. coli's energy metabolism model [24]. To reduce 
network complexity, we only show relationship with 
flux distribution greater than zero. Node sizes are 
rendered according to their flux distribution, so that re- 
actions with greater flux can be easily identified (Additional 
file 5). We compare the reaction network between the 
initial state (Figure 6b) and the simulated state (Figure 6c). 
The reactions with changed rate are easier to be identified 
in the 3D view due to the nodes in the centre of the net- 
work are dispersed in the 3D space, while the nodes in the 
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Figure 2 Visualization of expression data. Comparing 2D and 3D expression data visualization using VistaClara and 3DScapeCS using 
VistaClara heat map scheme. (a,d) gaMRGexp, (b,e) gal4RGexp, (c,f) ga!80Rexp. 



2D view are overlaid with each other. The evaluations 
with substrate-product network were carried out be- 
tween 3DScapeCS and FluxMap [25], which is a VANTED 
[26] add-on for the visual exploration of metabolic flux 
(Additional file 6). Although it features time-point simula- 
tion or conditions iteration in flux visualization, it lacks 
the "auto-play" functionality as 3DScapeCS does. Simula- 
tion of flux knockout example of glycolysis pathway in 
FluxMap example package shows that 3DScapeCS is use- 
ful in showing metabolic flux changes between different 
genotypes (Figure 3). 

Mass Spectrometry (MS) molecular network visualization 

Molecular network has been used for the analysis of 
MS/MS data and provide a new perspective to study mi- 
crobial natural products [27]. However, an efficient ap- 
proach needs attention to interpret the similarities or 
differences between several conditions in biological ex- 
periments. Nguyen D. et al has applied Cytoscape to 
visualize MS network generated from several MS experi- 
ments [28]. Here we present an example of the application 
3DScapeCS in monitoring the changes of metabolomics 
of the human pathogen C. albicans after the treatments 
using different doses of avermetin, a potent antifungal 
natural product (Ren et al, unpublished data). In this 
experiment, we treated C. albicans with 400 (Figure 4a), 
100 (Figure 4b), 25 (Figure 4c), 6.25 (Figure 4d), and 
1.5625 (Figure 4e) ug/mL of avermectin, respectively. 
Meanwhile, the pathogen treated with DMSO was used as 
negative control (Figure 4f). The monitoring of the metab- 
olites would help in identifying the pathway affected by 
avermectin and subsequently interprets the medicinal 



mechanism of avermectin. A motion graph varies in top- 
ology enables dynamic exploration of molecule changes at 
different conditions. This example is included in a saved 
Cytoscape session named MS.cys (in Additional file 3). 
User needs to export the session into SIF-format network 
and the filenames in node attributes into Cytoscape node 
attributes file. Then use separate.pl to divide the SIF- 
format network into several sub-networks according to 
their filename attributes provided. The generated SIF - 
format networks from A to F are at different dose treat- 
ments. Using "Order Networks", user makes them a 
motion network (Figure 4). The topological changes are 
useful in distinguishing different molecules produced at 
different dose treatments (Additional file 7). The parallel 
visualization on multiple monitors and animation iterat- 
ing all the networks also facilitate more comprehensive 
comparison. As a result, the avermectin treated cells 
produced more molecules compared with negative control 
(Figure 4f), and the molecules were gradually reduced 
along with the dose decreased, indicating that avermec- 
tin affected some pathways of C. albicans to produce or 
accumulate different molecules. Further identification 
of the different molecules from the node can help us to 
find the targeted pathway of avermectin. 

Detecting bubbles in De Bruijn graph 

De Bruijn graphs [29] are very common among Next- 
generation sequencing tools such as Velvet [30], ALL- 
PATHS [31], ABySS [32], Ray [33] and SOAPdenovo [34]. 
Assembly programs indexes reads in short words (/c-mers). 
Each node represents a series of overlapped /c-mers in de 
Bruijn graph, while arc represents their connection. Paths 
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traversing the graph are joined into contigs. Due to the ex- 
istence of biological variable and erroneous /<-mers, de 
Bruijn graph often forms bubbles, which is defined as sev- 
eral similar paths sharing the same start node and end 



node [30] . Bubbles are hard to resolve by de novo assem- 
blers, and therefore extension often ends up with an early 
termination in paths containing bubbles. In order to dis- 
tinguish such termination from sequencing gaps, it is 
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important in assessing the de Bruijn structure and detect- 
ing the bubbles. Here we evaluated a de novo assembly 
graph of Streptomyces avermitilis (Zhuo et aL, unpub- 
lished data), which was assembled using Velvet vl.2. 
We converted the LastGraph produced by Velvet into 
SIF-format network with 1,891 nodes and 1,921 edges 
using a Perl script debruijn.pl (in Additional file 3). 
While the two-dimensional network visualization failed 
to show the bubbles between nodes after force-directed 
layout in Cytoscape, the three-dimensional visualization 
indicates there are three bubbles in the highlighted re- 
gions (Figure 7). 



Discussion 

Three-dimensional visualization provides novel insights 
for Cytoscape. With time-course, colour, size, shape 
customization, 3DScapeCS can support up to seven di- 
mensional data visualization, making it useful in present- 
ing large-scale complex data. Although external Tenderers 
such as UbiGraph can add 3D functionalities to Cytos- 
cape, the operations of network are limited in UbiGraph. 
For example, user can neither drag vertices or edges in the 
graph to set locations, nor set the parameters of the multi- 
level layout algorithm, which make it less effective in het- 
erogeneous 3D data visualization. Therefore, in the future, 
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Figure 6 Visualization of metabolic flux data in a reaction network. Metabolic flux data for E.coli metabolism are visualized in a reaction 
network, (a) E.coli metabolism network in Cytoscape. (b) E.coli metabolism network in 3DScapeCS. (c) E.coli metabolism network with simulated 
data. Each node represents a reaction, with its size corresponding to the reaction rate. The changed reaction found to compare are highlighted 
with red circles. 




Figure 7 Visualization of de Bruijn graphs converted from 
velvet. Comparing 2D and 3D de Bruijn graph created from 
LastGraph file generated by Velvet. The bubbles found to compare 
are highlighted with red ellipse. Bubbles are not clear due to they 
were overlaid with other nodes or edges (upper) in Cytoscape. They 
become clear when rotated to a different perspective in 
3DScapeCS (lower). 



a more powerful renderer featured with more layouts and 
subcellular localization networks, such as neuronal net- 
works, should be implemented. Moreover, mobile devices 
such as iPad or Android Pad, which can be manipulated 
using fingers and gestures, are more suitable to manipu- 
late 3D view. So it is also desirable to implement Tenderers 
on those platforms for wireless parallel visualization. 

Conclusions 

We have integrated UbiGraph and Cytoscape in 3DSca- 
peCS. The 3D perspective not only guarantees user a 
greater experience in visualization, but also offers more 
insight into Cytoscape networks. Parallel motion graph 
is useful in visualizing data obtained with different con- 
ditions, as well as different aspects in a single experi- 
ment. Such functionalities give full play to their 
strength on a three-dimensional platform. Therefore 
3DScapeCS has more advantages in large and/or com- 
plex network visualization as the study cases presented 
in previous paragraphs. 

Availability and requirements 

Project name: 3DScapeCS 

Project home page: http://scape3d.sourceforge.net/ 
Operating system(s): Platform-independent. 
Programming language: Java 

Other requirements: Cytoscape v2.8, UbiGraph needs 

to be started as its renderer. 

License: Lesser General Public License (LGPL). 

Any restrictions to use by non-academics: None. 

Additional files 



Additional file 1: The 3DscapeCS Program File. Put 3DscapeCS.jar 
into cytoscape/plugins/ folder. 

Additional file 2: The 3DscapeCS User Manual. 

Additional file 3: The 3DscapeCS Case Study Data Unpack the 
zip package using WinZip, WinRAR, 7-zip or unzip command. 

See REAME.txt inside the package on how to use the test data. 
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Additional file 4: The 3DscapeCS Movie - Basic Operation and 
Expression Data Visualization. 

Additional file 5: The 3DscapeCS Movie - Reaction Network Motion 
Visualization. 

Additional file 6: The 3DscapeCS Movie - Substrate-product Net- 
work Motion Visualization. 

Additional file 7: The 3DscapeCS Movie - Motion Network 
Visualization of MS Experiments. 
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